Within-speaker variability of the word error rate for a continuous speech recognition system
نویسندگان
چکیده
The variance of the performance of a continuous speech recognition system subjected to replica utterances of the same sentence spoken by the same speaker has been investigated. In an experiment with three di erent speech recognition systems in three different languages with two di erent grammar conditions it is shown that the sentence word error rate has a variance that can be described in terms of binomial statistics. The distribution of the measured variance shows a remarkable correspondence to the parameterfree theoretical distribution. It is therefore concluded that for the word error rate of a continuous speech recognition system binomial statistics apply.
منابع مشابه
Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملError Analysis of Single Speaker Urdu Speech Recognition System
Speaker independent, spontaneous and continuous speech recognition system (ASR) can be integrated to other technologies like mobile to create an interface between technology and illiterate people so that they can use modern technologies. One of the major hurdles in such ASR is unacceptable word error rate. The paper explores the possibility of analyzing the Urdu speech corpus based on recogniti...
متن کاملModeling coarticulation in EMG-based continuous speech recognition
This paper discusses the use of surface electromyography for automatic speech recognition. Electromyographic signals captured at the facial muscles record the activity of the human articulatory apparatus and thus allow to trace back a speech signal even if it is spoken silently. Since speech is captured before it gets airborne, the resulting signal is not masked by ambient noise. The resulting ...
متن کاملVery fast adaptation for large vocabulary continuous speech recognition using eigenvoices
The principle of the eigenvoice method | using a priori knowledge on the speaker variability as collected during the training for a very fast adaptation | is applied to continuous speech recognition with large vocabulary. The handling of mixture density HMMmodels is discussed. For the case of gender independent models, a decrease of the word error rate of up to 15% is observed for unsupervised ...
متن کامل